Inducing Ontologies from Folksonomies using Natural Language Understanding
نویسندگان
چکیده
Folksonomies are unsystematic, unsophisticated collections of keywords associated by social bookmarking users to web content and, despite their inconsistency problems (typographical errors, spelling variations, use of space or punctuation as delimiters, same tag applied in different context, synonymy of concepts, etc.), their popularity is increasing among Web 2.0 application developers. In this paper, in addition to eliminating folksonomic irregularities existing at the lexical, syntactic or semantic understanding levels, we propose an algorithm that automatically builds a semantic representation of the folksonomy by exploiting the tags, their social bookmarking associations (co-occurring tags) and, more importantly, the content of labeled documents. We derive the semantics of each tag, discover semantic links between the folksonomic tags and expose the underlying semantic structure of the folksonomy, thus, enabling a number of information discovery and ontology-based reasoning applications.
منابع مشابه
On The Feasibility of Open Domain Referring Expression Generation Using Large Scale Folksonomies
Generating referring expressions has received considerable attention in Natural Language Generation. In recent years we start seeing deployments of referring expression generators moving away from limited domains with custom-made ontologies. In this work, we explore the feasibility of using large scale noisy ontologies (folksonomies) for open domain referring expression generation, an important...
متن کاملSelf-adaptation of Ontologies to Folksonomies in Semantic Web
Ontologies and tagging systems are two different ways to organize the knowledge present in the current Web. In this paper we propose a simple method to model folksonomies, as tagging systems, with ontologies. We show the scalability of the method using real data sets. The modeling method is composed of a generic ontology that represents any folksonomy and an algorithm to transform the informati...
متن کاملFlexible Natural Language Access to Community-Driven Metadata
A key issue in the Semantic Web is providing easy access to metadata for non-computer scientists. We believe that natural language is the best medium for this, and that the metadata framework should be open-ended in nature. We discuss four challenges: what type of interface to use, how to associate linguistic information with ontologies, how to achieve open-endedness, and how to stimulate colla...
متن کاملUsing the Social SemanticWeb to Support eSocial Science
In this paper we discuss how aspects of the Social Semantic Web can be used in an eScience context to deliver tools to researchers in the social sciences. We explain how user requirements led us to develop a hybrid solution involving use of ontologies and folksonomies in order to document the provenance of research resources, and to situate these within their wider (social) context. A natural l...
متن کاملVocabulary Conversion : Performance with Controlled and Uncontrolled Terms and Tags Technical
Controlled and uncontrolled indexing terminology and metadata may be converted from one to another. Decision criteria are developed that can be used to determine which terms should be assigned when converting vocabularies. Methods are developed for computing the parameters of these systems, as well as means for estimating the parameters when given limited information. These conversion technique...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010